Supporting Realistic OpenMP Applications on a Commodity Cluster of Workstations

نویسندگان

  • Seung-Jai Min
  • Ayon Basumallik
  • Rudolf Eigenmann
چکیده

This paper describes compiler techniques that can translate standard OpenMP applications into code for distributed computer systems and optimization techniques to enhance the performance of realistic OpenMP application. OpenMP has emerged as an important model and language extension for shared-memory parallel programming. However, despite OpenMP's success on these platforms, it is not currently being used on distributed system. The goal of our project is to quantify the degree to which such a use is possible and develop supporting compiler techniques. Our present compiler techniques translate OpenMP programs into a form suitable for execution on a Software DSM system. We have implemented a compiler that performs this basic translation, and we have studied optimization techniques that improve the baseline performance of OpenMP applications on distributed computer systems. Our results show that, while kernel benchmarks can show high eÆciency of OpenMP programs on distributed systems, full applications need careful consideration of shared data access patterns. A naive translation (similar to OpenMP compilers for SMPs) leads to acceptable performance in very few applications only. However, additional optimizations, including computation repartitioning and access privatization result in average 300% (or from 20% to x16) speedup improvement on our SPEC OMPM2001 benchmarks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel Implementation of Exact Two Dimensional Pattern Matching Algorithms using MPI and OpenMP

The need for processing power is constantly increasing as more processing-demanding and time-critical applications appear. Parallel processing has emerged as an efficient and costeffective solution, since low-end workstations and multiprocessors are cheaply available in the commodity market. In this paper, experimental results are presented on the parallel processing of the Naive, Karp and Rabi...

متن کامل

XML Opportunities in Real Time Immersive Simulation & Visualization Based on Clusters of Commodity Computers

Real Time Immersive Simulation and Visualization applications have been powered traditionally by high-end graphics workstations or supercomputers. But recently, clusters of commodity computers (PCs, Macintoshes, low cost workstations) have become a practical alternative. The advantages of a commodity cluster include low cost, flexibility, performance scalability and use of to legacy systems. Th...

متن کامل

Managing Commodity Computer Cluster Oriented to Virtual Reality Applications

Virtual Reality (VR) applications have been powered traditionally by high-end graphics workstations or supercomputers. But recently, clusters of commodity computers (PCs, Macintoshes, low cost workstations) have become a practical alternative. The advantages of a commodity cluster include low cost, flexibility, access to technology, and performance scalability. The main goal of our proposal is ...

متن کامل

Performance Analysis of PC-CLUMP based on SMP-Bus Utilization

PC-CLUMP (Cluster of Multiprocessor) is one of the most cost-e ective commodity-based platforms for HPC applications. The increasing number of CPUs per SMP node realizes very compact system size and very low price on the network interface per processor keeping the number of CPUs in the system. However, the performance of SMP-bus on such an SMPPC node is relatively poor compared with that of SMP...

متن کامل

Management of Commodity Computer Cluster Oriented to Virtual Reality Applications

Virtual Reality (VR) applications have been powered traditionally by high-end graphics workstations or supercomputers. But recently, clusters of commodity computers have become a practical alternative. The advantages of a commodity cluster include low cost, flexibility, access to technology, and performance scalability. The main goal of our proposal is to offer both a High Performance Computing...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003